首页> 外文OA文献 >Adversarial Network Bottleneck Features for Noise Robust Speaker Verification
【2h】

Adversarial Network Bottleneck Features for Noise Robust Speaker Verification

机译:噪声鲁棒音箱的对抗性网络瓶颈特征   验证

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper, we propose a noise robust bottleneck feature representationwhich is generated by an adversarial network (AN). The AN includes two cascadeconnected networks, an encoding network (EN) and a discriminative network (DN).Mel-frequency cepstral coefficients (MFCCs) of clean and noisy speech are usedas input to the EN and the output of the EN is used as the noise robustfeature. The EN and DN are trained in turn, namely, when training the DN, noisetypes are selected as the training labels and when training the EN, all labelsare set as the same, i.e., the clean speech label, which aims to make the ANfeatures invariant to noise and thus achieve noise robustness. We evaluate theperformance of the proposed feature on a Gaussian Mixture Model-UniversalBackground Model based speaker verification system, and make comparison to MFCCfeatures of speech enhanced by short-time spectral amplitude minimum meansquare error (STSA-MMSE) and deep neural network-based speech enhancement(DNN-SE) methods. Experimental results on the RSR2015 database show that theproposed AN bottleneck feature (AN-BN) dramatically outperforms the STSA-MMSEand DNN-SE based MFCCs for different noise types and signal-to-noise ratios.Furthermore, the AN-BN feature is able to improve the speaker verificationperformance under the clean condition.
机译:在本文中,我们提出了一种由对抗网络(AN)生成的噪声鲁棒瓶颈特征表示。 AN包含两个级联网络,一个编码网络(EN)和一个判别网络(DN)。干净和嘈杂的语音的梅尔频谱倒谱系数(MFCC)用作EN的输入,而EN的输出用作强大的噪音功能。依次训练EN和DN,即在训练DN时,选择噪声类型作为训练标签,而在训练EN时,所有标签都设置为相同,即干净的语音标签,目的是使ANfeatures不变噪声,从而达到噪声鲁棒性。我们在基于高斯混合模型-通用背景模型的说话人验证系统上评估提出的功能的性能,并与通过短时频谱幅度最小均方误差(STSA-MMSE)和基于深度神经网络的语音增强功能增强的MFCC语音功能进行比较。 (DNN-SE)方法。 RSR2015数据库上的实验结果表明,针对不同的噪声类型和信噪比,拟议的AN瓶颈功能(AN-BN)明显优于基于STSA-MMSE和DNN-SE的MFCC。此外,AN-BN功能能够在干净的条件下提高扬声器的验证性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号